Scientific Data
Top medRxiv preprints most likely to be published in this journal, ranked by match strength.
Show abstract
Background and Purpose: Magnetic resonance imaging (MRI) for radiation therapy treatment planning is currently being used in many anatomical sites to better visualize soft tissue landmarks, a technique known as an MRI simulation. A core component of modern MRI simulation configurations are the use of external laser positioning systems (ELPS) to help set up the patient. Though necessary for accurate and reproducible patient setup, the ELPS, if left on during imaging, may interfere negatively with...
Show abstract
Immune-related adverse events (irAEs) affect up to 40% of patients receiving immune checkpoint inhibitors, yet their identification depends on laborious and inconsistent manual chart review. Here we developed and evaluated an agentic large language model system to extract the presence, temporality, severity grade, attribution, and certainty of six irAE types from clinical notes. Retrospectively (263 notes), the system achieved macro-averaged F1 of 0.92 for detection and 0.66 for multi-class seve...
Show abstract
BackgroundRetrieval-augmented generation (RAG) frameworks such as RAPID [1] have demonstrated that staged planning and retrieval grounding improve long-form text generation. However, most implementations remain similarity-driven and open-domain, lacking the epistemic safeguards required for biomedical synthesis, where mechanistic completeness, temporal governance, traceability, and explicit gap classification are essential. ObjectiveTo develop and evaluate a topology-aware, graph-augmented retr...
Show abstract
BackgroundAlthough deep learning models have improved individual PET analysis, image processing and quantification tasks, end-to-end automation from raw DICOM to quantitative clinical reporting remains limited, particularly in heterogeneous real-world settings. MethodsAs a proof-of-concept, an autonomous large language model (LLM)-orchestrated multi-tool agent for end-to-end PET/CT interpretation was developed. A reasoning-based text LLM selected appropriate series from raw DICOM, coordinated r...
Show abstract
BackgroundMotor threshold (MT) estimation is fundamental to transcranial magnetic stimulation (TMS), guiding individualized stimulation intensity in research and therapy. Conventional methods such as the 5-out-of-10 rule require many stimuli, while adaptive approaches like Parameter Estimation by Sequential Testing (PEST) improve efficiency but can exhibit poor convergence under certain conditions. ObjectiveThis study introduces the Bayesian Uncertainty Dynamic Algorithm for Parameter Estimatio...
Show abstract
Current standard of care imaging practices cannot reliably differentiate among certain renal tumors such as benign oncocytoma and clear cell renal cell carcinoma (RCC), and between low and high grade RCCs. Previous work has explored using deep learning, radiomics, and texture analysis to predict renal tumor subtypes and differentiate between low and high grade RCCs with mixed success. To further this work, large diverse datasets are needed to improve model performance and provide strong evaluati...
Show abstract
Deep-learning based super-resolution has shown promise for enhancing the spatial resolution of brain magnetic resonance images, which may help visualize small anatomical structures more clearly. However, when only limited training data are available, it remains uncertain which model assessment method provides the most reliable estimate of out-of-sample performance. In this study, three widely used assessment strategies (three-way holdout, k-fold cross-validation, and nested cross-validation) wer...
Show abstract
There are many alternative methods to joystick control for control of Electric Powered Wheelchairs for users with neuromuscular disabilities, such as muscular dystrophy, and spinal cord injuries, such as tetraplegia. However, these methods- which include the sip-and-puff method, head and neck movement, blinking, or tongue movement- hinder social interaction, and are therefore detrimental to user independence. In recent years, research has explored the use of Electromyography (EMG) signals from a...
Show abstract
Artificial intelligence (AI) based segmentation has many medical applications but limited curated datasets challenge model training; this study compares the impact of dataset annotation quality and quantity on whole liver AI segmentation performance. We obtained 3,089 abdominal computed tomography scans with whole-liver contours from MD Anderson Cancer Center (MDA) and a MICCAI challenge. A total of 249 scans were withheld for testing of which 30, MICCAI challenge data, were reserved for externa...
Show abstract
PurposeNeonatal imaging is particularly challenging because newborns have a high likelihood of head motion, which can degrade image quality and complicate interpretation. Improving MRI brain image quality may help reduce diagnostic uncertainty and facilitate the nuanced assessment of early myelinating structures in the neonatal brain. Although deep learning reconstruction algorithms designed to improve MRI image quality have been evaluated in pediatric imaging, they have not been specifically st...
Show abstract
Chemical shift-encoded magnetic resonance imaging using high-resolved 3D Dixon techniques enables the non-invasive and radiation-free assessment of whole-body adipose tissue and ectopic fat distribution. Automatic deep learning-based segmentation of metabolically relevant adipose tissue compartments and ectopic fat deposits in parenchymal tissue is the most important image processing step for the quantification of adipose tissue volumes and ectopic fat percentages from whole-body imaging. This ...
Show abstract
Brain tumors are one of the most life-threatening diseases, requiring precise and timely detection for effective treatment. Traditional methods for brain tumor detection rely heavily on manual analysis of MRI scans, which is time-consuming, subjective, and prone to human error. With advancements in deep learning, Convolutional Neural Networks (CNNs) have become popular for medical image analysis. However, CNNs are limited in their ability to capture spatial hierarchies and pose variations, which...
Show abstract
In previous work, we achieved state-of-the-art performance on ChestX-ray14 (ROC-AUC 0.940, F1 0.821) using pretraining diversity and clinical metric optimization. Applying the same methodology to CheXpert, we received similar results when using NLP valuation and test data--but when evaluated against expert radiologist labels, performance was only 0.75-0.87 ROC-AUC. The models had learned to match the automated NLP labeling system, not to diagnose disease. This paper documents our investigation ...
Show abstract
Rationale and ObjectivesContrast-enhanced (CE) MRI provides clear corticomedullary contrast for renal compartment delineation but may be contraindicated or undesirable in routine practice. We aimed to enable automated extraction of renal imaging biomarkers from routine non-contrast-enhanced (NCE) T1-weighted MRI by transferring CE-derived compartment labels. Materials and MethodsThis retrospective single-center study (January 2017 to December 2021) included 200 participants with paired arterial...
Show abstract
Digital breast tomosynthesis (DBT) is a powerful imaging modality that allows for improved lesion visibility, characterization, and localization compared to conventional two-dimensional digital mammography. DBT has been increasingly adopted in screening and diagnostic settings globally, particularly for women with dense breast tissue where tissue overlap presents a significant diagnostic challenge. Here we describe DBT-2026, a real world imaging dataset with 558 DBT exams from 558 patients with ...
Show abstract
Sodium (23Na) magnetic resonance imaging (MRI) provides valuable metabolic information, but it is limited by a low signal-to-noise ratio (SNR) and long acquisition times. To overcome these challenges, we present a Deep Image Prior (DIP)-based framework that combines anatomically guided proton (1H) MRI and metabolically guided 23Na MRI denoising via a fused proton-sodium prior within a directional total variation (dTV) regularization scheme. The DIP-Fusion approach minimizes a variational loss fu...
Show abstract
BackgroundBreast cancer-related lymphedema (BCRL) is a common complication following breast cancer treatment. While lymphoscintigraphy is considered the diagnostic gold standard, it is unsuitable for routine periodic monitoring or assessment of treatment efficacy. Shear wave elastography (SWE) offers a possible alternative, but traditional modes of operation limit its potential. Proposed SolutionsThe Holder-Optimized Elastography (HOE) method is introduced to eliminate pressure issues introduce...
Show abstract
Dense breast tissue diminishes the sensitivity of mammographic screening and is a key cancer risk factor, which motivates accurate segmentation under scarce and expensive expert annotations in the medical imaging domain. Here, we benchmark the effect of backbone architecture, self-supervised pre-training (SSL), fine-tuning strategy, and loss design for dense-tissue segmentation on a small expert-labeled dataset (596 images) and an in-domain unlabeled corpus (20, 000 images), reflecting the lack ...
Show abstract
Non-alcoholic fatty liver disease (NAFLD) is a globally prevalent hepatic condition caused by the buildup of fat in the liver. It is frequently associated with metabolic comorbidities such as hypertension, cardiovascular disease (CVD), and prediabetes. However, early detection remains challenging due to the asymptomatic progression, and existing primary diagnostic methods, such as imaging or liver biopsy, are often expensive and inaccessible in rural areas. This study proposes a two-stage, inter...
Show abstract
Causal discovery algorithms are often leveraged for inferring causal relationships and recovering a causal model from data. However, causal discovery from data alone is limited by the structural constraints of the used dataset, the lack of causal logic, and the lack of external knowledge. Thus, data-driven causal discovery can only suggest possible causal relationships at best. To overcome these limitations, Large Language Models (LLMs) and knowledge systems, such as Retrieval-Augmented Generati...